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Abstract 

Assume that individuals alive at time t in some population can be ranked in such a way 
that the coalescence times between consecutive individuals are i.i.d. The ranked sequence 
of these branches is called a coalescent point process. We have shown in a previous work 
[14j that splitting trees are important instances of such populations. 

Here, individuals are given DNA sequences, and for a sample of n DNA sequences 
belonging to distinct individuals, we consider the number S n of polymorphic sites (sites at 
which at least two sequences differ), and the number A n of distinct haplotypes (sequences 
differing at one site at least). 

It is standard to assume that mutations arrive at constant rate (on germ lines), and 
never hit the same site on the DNA sequence. We study the mutation pattern associated 
with coalescent point processes under this assumption. Here, S n and A n grow linearly as n 
grows, with explicit rate. However, when the branch lengths have infinite expectation, S n 
grows more rapidly, e.g. as n ln(n) for critical birth death processes. 

Then, we study the frequency spectrum of the sample, that is, the numbers of poly- 
morphic sites/haplo types carried by k individuals in the sample. These numbers are shown 
to grow also linearly with sample size, and we provide simple explicit formulae for muta- 
tion frequencies and haplotype frequencies. For critical birth-death processes, mutation 
frequencies are given by the harmonic series and haplotype frequencies by Fisher's logarith- 
mic series. 
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1 Introduction 



1.1 The coalescent point process 

Splitting trees are those random trees where individuals give birth at constant rate b during 
a lifetime with general distribution A(-)/6, to i.i.d. copies of themselves (see |12j). where A is 
a positive measure on (0,oo] with total mass b called the lifespan measure. In [J3], we have 
shown that if the splitting tree is started from one individual with known birth time, say 0, 
and known death time, then individuals alive at time t can be ranked in such a way that the 
coalescence times between consecutive individuals are i.i.d. 
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Figure 1: A coalescent point process for n = 16 individuals. 

Specifically, let Nt be the number of individuals alive at time t. The process (Nt',t > 0) is 
a (homogeneous, binary) Crump-Mode-Jagers process, and is not Markovian unless A has an 
exponential density or is a point mass at oo. To these Nt individuals, give labels 0, 1, . . . , Nt — 1 
according to the (unique) order complying with the following rule : 'any individual comes before 
her own descendants, but after her younger siblings and their descendants'. For any integers i, k 
such that < i < i + k < Nt, we let Ci^ + k be the coalescence time (or divergence time) between 
individual i and individual i + k, that is, the time elapsed since the lineages of individual i and 
i + k have diverged. Also define Hi+\ := Ci i+i. Then recall from [H] that for a splitting tree, 

Cj,i+fc = max{F i+ i, . . . , H i+k } (1) 

and conditional on {Nt ^ 0}, the sequence (Hi; 1 < i < Nt — 1) has the same law as a sequence 
of i.i.d. r.v. killed at its first value >t. As a by-product, we get that the law of Nt conditional 
on {N t 7^ 0} is geometric. 

The aforementioned property comes from the fact that the jumping contour process of the 
splitting tree is a Levy process X = {X s ; s > 0) with Levy measure A and drift coefficient — 1. 
Then the excursions of the contour process between consecutive visits of points at height t are 
i.i.d. excursions of A. As a consequence, the (Hi) are also i.i.d., and their common distribution 
is that of H' := t — inf s X s , where X is started at t and killed upon hitting {0} U (t, +oo). Note 
that all branch lengths but the last one are distributed as some r.v. H which is H' conditioned 



2 



to be smaller than t. The distribution of H 1 can be expressed in terms of a nonnegative, 
nondecreasing, differentiable function W, called the scale function of X, such that VF(0) = 1 

F(H> > x) = ^ x > 0. (2) 

The scale function W is characterised by its Laplace transform (see e.g. j6]) 



oo 



dx e~ Xx W(x) = ( A — / A(ds)(l - e~ AX ) ) . (3) 



o 



A(dx){l - e~ Xx )) 
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From now on, with no need to refer to the framework of splitting trees, we will consider the 
genealogy of what we call a coalescent point process (originating from [T7j where A(dx) = 
b 2 exp(— bx)dx) : 

1. let H\, H2, ■ ■ ■ be a sequence of independent random variables called branch lengths all 
distributed as some positive r.v. H, and set Hq to equal +00. 

2. the genealogy of the population {0, 1, 2, . . .} is given by ([I]). 
We will stick to the notation 

W(x) := — — r x > 0. 

v 1 P(H > x) 

It will always be implicit that a sample of n individuals refers to the first n individuals 
{0,l,...,n-l}. 

Remark 1 In the case of splitting trees, conditional on {Nt 7^ 0} ; Nt is geometric with success 
probability ¥(H' > t), and conditional on {Nt = n}, the branch lengths (Hi; 1 < i < n — 1) 
are i.i.d. with distribution ¥(H' £ • \ H' < t). In what follows, we will repeatedly refer to 
the genealogy of a splitting tree with n leaves by setting the r.v. H to equal H' , without the 
conditioning (i.e. t — > 00). In the subcritical case, this amounts to considering quasi- stationary 
populations, which are those populations conditioned to be still alive at time t, as t — > 00 (see 
e.g. [15]). Another possibility would be, as in JJ|/ ; to give a prior distribution to the time t of 
origin, and condition the whole tree on {Nt = n}. Then as n — > 00, the posterior distribution 
oft goes to 00, and we would be left with a (possibly different) distribution of H charging the 
whole half-line. 

Remark 2 No distribution of edge lengths can make the coalescent point process coincide with 
the Kingman coalescent ]15tf . Indeed, here, the smallest branch length in a sample of n individ- 
uals is the minimum of n — 1 i.i. d. random variables, whereas in the Kingman coalescent, it is 
the minimum of n(n — l)/2 i.i.d. random variables (with exponential distribution). 

Our goal is to characterise the mutation pattern for samples of n individuals, mainly as n 
gets large. We specify the mutation scheme in the next subsection. 

Works studying mutation patterns arising from random genealogies are numerous. Muta- 
tion patterns related to populations with fixed size (Wright-Fisher model, Kingman coalescent) 
are well-known and culminate in Ewens ' sampling formula (see [9] for a comprehensive account 
on that subject). More recent works concern mutation patterns related to more general coa- 
lescents [U [16] , to branching populations [HE], or to both [5] . 
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1.2 Mutation scheme 



We adopt two classical assumptions on mutation schemes from population genetics (see e.g. 

1. mutations occur at constant rate 9 on germ lines, 

2. mutations are neutral, that is, they have no effect on birth rates and lifetimes. 

As is usual, we assume that mutations are point substitutions occurring at a single site on the 
DNA sequence, and that each site can be hit at most once by a mutation. This last assumption 
is known as the infinitely-many sites model (ISM). Instances of DNA sequence are called alleles 
or haplotypes, so that under the ISM, each mutation yields a new allele. Without reference 
to DNA sequences, this last assumption by itself is known as the infinitely-many alleles model 
(IAM). 

Specifically, we let (Vi;i = 0,1,2...) be independent Poisson measures on (0, oo) with in- 
tensity 9 (cf. assumption 1). For each i we denote the atoms of V% by in < li2 < ■ ■ ■ and call 
them mutations. Now let H\,H2, ... be an independent coalescent point process (cf. assump- 
tion 2). In agreement with the genealogical structure of a coalescent point process explained 
in the beginning of this section, we will say that individual i + k carries (or bears) mutation 
£ij if k > and 

m&x{H i+1 , . . . H i+k } < £ij < Hi, 

where we agree that max0 = and Hq = +oo. The second inequality is trivially due to the fact 
that we throw away all atoms £y such that Hi < £ij. The set of mutations that an individual 
bears is her allele or her haplotype, or merely her type. 

For a sample of n individuals, we call S n the number of polymorphic sites, that is, the number 
of mutations {£%j\ < i < n — 1, j > 1) that are carried by at least one individual and at most 
n — 1. Formally, this yields 

S n = Card{^j < H it 1 < i < n - l,j > 1} + Card{£ j < maxjiii, . . . , H n -i},j > 1}. 

Further, we define S n (k) as the number of mutations carried by k individuals in the sample. 
In particular, 

ri-l 

Sn = ^ 1 S n (k). 

k=l 

The sequence (S n (l), . . . , S n (n — 1)) is called the site frequency spectrum of the sample. 

Similarly, we call A n the number of distinct haplotypes in a sample of n individuals, that 
is, the number of alleles that are carried by at least one individual, and A n (k) as the number 
of alleles carried by k individuals. In particular, we have 

n n 

A n = ' s ^A n (k) and ' s ^kA n (k) = n. 

k=l k=l 

The sequence (A n (l), . . . ,A n (n)) is called the allele frequency spectrum of the sample. 

Remark 3 One always has the inequality S n > A n — 1. Indeed, apart from the ancestral 
haplotype, each new haplotype independent of at least one new mutation. 
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Figure 2: A coalescent point process with mutations for a sample of n = 9 individuals. Site a 
is not polymorphic because no individual in the sample carries a mutation at that site; site g 
is not polymorphic because all individuals in the sample carry the mutation at that site. The 
number of polymorphic sites is S n = 6. The number of distinct haplotypes is A n = 5. 



1.3 Examples of coalescent point processes 

Before going into the main part of this work, we provide a few simple examples of coalescent 
point processes derived from splitting trees, in part for application purposes. 

Yule tree. When A is a point mass at oo, the splitting tree is a Yule tree, and (Nfjt > 0) is a 
pure-birth binary process with birth rate, say a. Then W(x) = e ax , and H has an exponential 
distribution with parameter a (see [TT]). 

Birth— death process. When A has an exponential density, (A^; t > 0) is a Markovian birth- 
death process with (birth rate b and) death rate, say d. Then it is known (see [TJj for example) 
that if b ^= d, then 

„ r , , d-beO-V* 

W{x) = — x > 0, 

whereas if b = d =: a, 

W(x) = 1 + ax x > 0. 

Notice that in the subcritical case (b < d), H can take the value oo with probability 1 — (b/d), 
which is due to the constrained size of quasi-stationary populations (see Remark[T]). Elementary 
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calculations show that H conditioned to be finite has the same law as the branch length of a 
supercritical birth-death process with birth rate d and death rate b. 



Consistency and sampling. The genealogy associated with a coalescent point process is 
consistent in the sense that the genealogy of a sample of n individuals has the same law as 
that of a sample of n + 1 individuals from which the last individual has been withdrawn (in 
the splitting tree framework, the last individual is the individual who has no descendants in 
the sample, and whose ancestors have no elder sibling with descendants in the sample). This 
property would not hold any longer if the withdrawn individual was chosen at random. 

On the other hand, if all individuals in the population are censused independently with 
probability c, then the genealogy of the census is still that of a coalescent point process. 
Indeed, the typical branch length is H" , where 

H" — max{i?i, . . . , Hk}, 

and K is an independent (modified) geometric r.v., that is, F(K = j) = c(l — c) J . As a 
consequence, 

-J— := F(H" >i) = l- Vc(l- C y- l ¥(H < x) j x > 0. 

w c (x) m 

This last equation also reads 

W c = 1 - c + cW. 

Applying this Bernoulli sampling procedure with intensity c to the previous examples yields 
the following elementary results. 

— the census of a Yule population has the genealogy of a birth-death process population, with 

birth rate ac and death rate a(l — c) 

— the census of a birth-death process population has the genealogy of another birth-death 

process population with birth rate be and death rate d— 6(1 — c). In particular, censusing 
a critical birth-death process population with rate b = d =: a amounts to replacing a 
with ac. 



Infinite lifespan measure. Actually, everything that was stated about splitting trees still 
holds if the lifespan measure is infinite, provided the lifespans of children remain summable, 
that is /^(l ^ r)A(dr) < oo. In particular, one still has W(0) = 1, and the number of 
individuals alive at a fixed time remains a.s. finite. 

On the contrary, it is a completely different task to define the real tree whose jumping 
contour process is a Levy process with no negative jumps but infinite variation (see [6]). 
However, in our setting, this only requires replacing the coalescent point process i?i,i?2, . . . 
with a true Poisson point process with intensity measure ds u(dx), where v is a cr-finite positive 
measure defined as the push forward of the excursion measure of X away from {t} by the 
function which maps an excursion e into t — inf s e s . Similarly as in the finite variation case, 

W(x) := 1 x > 0. 

u({x,oo)) 
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In the Brownian case, for example v(dx) = x~ 2 dx (again, see p2]), that is, W(x) = x. 

Here, the analogue of Bernoulli sampling with intensity c consists in taking the maximum 
H" of the point process on an interval with exponential length of parameter c (instead of a 
geometric length). Now c can take any positive value. Standard calculations then yield 

' F{H" >x) = l £ - x > 0, 



W c (x) c + u((x, oo)) 
so that 

W c = 1 + cW. 

As far as splitting trees with infinite variation are concerned, we will only focus on the stable 
case, where W(x) = x a ~ l for some a G (1,2], the Brownian case corresponding to a = 2. In 
particular, we see that the Brownian coalescent point process censused with intensity c has the 
same law as the coalescent point process associated with a critical birth-death process with 
rate c. 

1.4 Statements, outline, examples 

Our results regarding polymorphic sites are stated in Section [2j 

In the first two subsections of Section [21 we assume that ~E(H) is finite. Theorem 12.11 
provides a law of large numbers and a central limit theorem (if H has a second moment) on 
the number S n of polymorphic sites. In particular, 

lim — = 0E(H) a.s. (4) 

n— »oo n 

We also give exact explicit formulae for the expectation of the number S n {k) of mutations 
carried by k individuals in a sample of n. 

In the third subsection, we make the less stringent assumption that E(min(i?i, -H2)) is 
finite. Theorem 12.31 then gives the asymptotic behaviour of the site frequency spectrum of 
large samples via the following a.s. convergence 

s n (k) _ r° dx t 1 n*- 1 

n^L n J W(x) 2 V W(x) J 1 ' 

In the fourth subsection, we treat the case of stable laws with parameter a, that is, W is 
given by W(x) = 1 + cx Q_1 , where a G (1,2] and c is some positive parameter that can be 
interpreted as a sampling intensity. Since here E(i?) = 00, the only result holding in the stable 
case is dS}, and only for a > 3/2. Theorems 12.41 and 12.51 give the asymptotic behaviour of S n . 
When a = 2, S n /nln(n) converges in probability (to 6/c), and when Sn/n 13 converges 

in distribution, with = l/(a — 1). 

Section [3] displays our results regarding distinct haplotypes. The trick is to characterise the law 
of the branch length H d of the next individual bearing no mutation other than those carried 
by, say, individual 0. Proposition 13.11 does this as follows 



F(H e > x) 



W e {x) = l+( W'{u)e' eu du x > 0. 
Jo 
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Theorem 13.21 states a.s. convergences without moment existence assumptions. Specifically, 

lim ^ = E(l-e- dHB ) a.s., (6) 



n^oo n 

and the allele frequency spectrum for large samples is given by the following a.s. convergence 

n^oo n J W e {x) 2 \ W e {x) J 

Before ending this last subsection, we want to point out that in some cases, more explicit 
formulae can be computed. First, for the Yule process with birth rate 1, (or with parameter a, 
but after replacing 9 with a0), that is, when W{x) = e x , one gets easily 

lim — = 9 and lim 



rwoc n n— >oo n k{k + 1) 

Computations are not as straightforward for the number of haplotypes. Second, for the critical 
birth-death process with birth rate 1 (or with parameter a, but after replacing 9 with a9), that 
is, when W(x) = 1 + x, one gets 

lim — - = 9 and lim - = — . 
moo n ln(n) n— >oo n k 

In addition, 

lim ^L = 9hx + and lim ^® = °- (1 + 9y k . 

n— >oo n n^oo n k 



Remark 4 It is amusing to notice that the rescaled number A n (k) of haplotypes with k rep- 
resentatives is also the probability that a species has k representatives in Fisher's log-series of 
species abundance J71]/ . In Fisher's model, a given species has an unknown density which is 
assumed to be drawn from a Gamma distribution with parameter a. As a result of Bernoulli 
sampling in a large population, it is then assumed that given the value d of this density, the 
number X of individuals spotted from this species is Poisson with parameter pd, where p is the 
sampling intensity. It can then be shown that as a j ; conditional on {X > 1} (since at least 
one individual must be spotted for the species to be recorded), F(X = k) goes to (7(1 + 1/ p)~ k /k, 
for some normalising constant C . 

Remark 5 In a coalescent point process, divergence times are on average deeper than in the 
Kingman coalescent (our trees are more 'star-like'). This forbids convergence of our statistics 
without rescaling (by the sample size n or by nln(n) ). In particular, notice that the asymptotic 
proportion of individuals in a cluster of size greater than K, i.e. \\m n n~ l ^2, k>K A n (k), van- 
ishes as K grows to oo . This shows that the largest cluster in a sample of n has neglectable 
size w.r.t. n, which contrasts with the Kingman coalescent, where the allele frequency spectrum 
is given by Ewens' sampling formula (see J$l liOj/). As n — > oo, the numbers of haplotypes 
A n (k) carried by k individuals J3]/ converge to independent Poisson r.v. with parameter 9/k, 
and the i-th eldest haplotype fSjj is carried by approximately P^n individuals, where (Pi\i > 1) 
is a Poisson-Dirichlet r.v. 
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2 Number of polymorphic sites 



Results for polymorphic sites depend on integrability assumptions on H. Of course these are 
always fulfilled if the time t when the population was founded is known, since then H <t a.s. 
We will see that the critical assumptions are either E(min(i?i, #2)) < °°, or the more stringent 
K(H) < 00. Notice that the first assumption is equivalent to the integrability of 1/W 2 , and 
the second one to the integrability of 1/W. 

2.1 Law of large numbers and central limit theorem 

Recall that S n is the number of polymorphic sites in the sample of n individuals. 

Theorem 2.1 IfE(H) < 00, then 

lim n~ 1 S n = 8E(H) a.s. and in L 1 . 

n^oo 

If in addition E(H 2 ) < 00, then 

y/n^Sn-OEiH)) 

converges in distribution to a centered normal variable with variance 9K(H) + 9 2 Var(H). 
Proof. Set Y n := max{i/i, . . . , i? n _i}. Recall from the Introduction that 

n-l 

S n = Qi + R n , 

i=l 

where Qi is the number of points of the Poisson point process Vi in (0, Hi), and R n is the number 
of points of the Poisson point process Vq m (0,Y n ). By the strong law of large numbers, we 
know that 

n-l 

lim n^S^Qi = 6E(H) a.s. and in L 1 , 

n^oo — ' 
i=l 

so we need to prove that 

lim n~ l R n = a.s. and in L 1 . 

n-+oo 

Now because R n /Y n converges to 6 a.s. and in L 1 , it is sufficient to prove that 

lim n~ l Y n = a.s. and in L 1 . 

n^oo 

Because Y n <J2^H h 

lim sup n~ l Y n =: Y < 00 a.s. 

By the 0-1 law, Y is not random. To prove that Y = 0, we let Y^ (resp. Yn 2 ^) be the maximum 
of the H^s indexed by odd (resp. even) numbers. Then it is clear that Y n = max(yi 1 - ) , Y^), 
and that n -1 !^ 1 -* as well as n -1 !^ 2 -* both converge to Y/2. This shows that Y = Y/2, so that 
Y = 0. 
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For convergence in L , pick any x > 0, and notice that 

n _1 E(y n ) = n- l ^{Y n ,Y n <x)+n- l ^{Y n ,Y n >x) 

< n -l x + n -l E (j^ Hil{Hi>x}\ 

< n^x + E^^H > x). 

Since E(H) < oo, this last inequality shows that n _1 E(l^) vanishes as n — > oo. 

Now we prove the central limit theorem for S n . It is elementary to compute Var(Qi) as 
9E(H) + 9 2 V&i(H), so by the classical central limit theorem applied to the sum of Qi's, we 
only have to prove that R n /^/n converges to in probability. For any A > 0, 

E (exp [-XRn/y/nj) = E (exp [~9Y n (l - e - A ^))) , 

which shows it is sufficient to prove that Y n j^fn converges to in probability. As previously, 
we write 

n- x E(Y%) = n- 1 K(YlY n <x)+n- 1 E(YlY n >x) 

< n ~ V + n -i E ^ H?l {Hi>x} j 

< n- 1 x 2 + E{H 2 ,H > x) . 

Thus, convergence of Y n j^fn to holds in L 2 , and subsequently, it holds in probability. □ 



2.2 Explicit formulae for the expected frequency spectrum 

Recall that S n (k) denotes the number of mutant sites that are carried by exactly k individuals 
in the sample of n individuals (and since we only count polymorphic sites, S n (n) = 0). 

Theorem 2.2 For all 1 < k < n - 1, 

VWi» = e [ dx (i - ^) (n^l + ^) , 

which is finite if and only ifE(H) < oo. Then in particular, 

r°° rlr / 1 \ k ~ l 

lim n- 1 E(5 n (fe)) = 9 / — 1 - — — 
n^oo v " v J Q W{x) 2 V W{x) / 

Remark 6 Taking the sum over k in the r.h.s. of the last equality of the theorem, one gets 
6E(H), so that, thanks to the L 1 convergence in Theorem \2.1\ 

n-l 

lim ji^Ve^)) = 9E(H) = V lim n -1 E(S n (ife)). 

k=l k>l 
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Before giving a proof of the previous theorem, we want to make a point that will also be 
useful in the next subsection. For any tree with point mutations, a mutation is carried by k 
individuals if and only of it is in the part of the tree subtending k leaves. Then in any given 
tree with edge lengths and Poisson point process of mutations (with rate 0) independent of 
the genealogy (as in our situation), the expectation of the number of mutations carried by k 
individuals is 9L k , where L k is the Lebesgue measure of the part of the tree subtending k leaves 
(i.e., tips). In our setting, we will call Lk(n), for k < n — 1, the Lebesgue measure of the part 
of the tree subtending k tips among individuals {0, 1, . . . , n — 1}, so that 

E(S n (fc)) = 9E(L k (n)) l<fc<n-l. 

Remark 7 The last equality along with more specific considerations given in the next sub- 
section provide a less analytic and more transparent proof than the proof we give hereafter. 
However, we stick to it for the interest of the method itself. 

Proof of Theorem 12.21 We set N(x) to be the smallest i > 1 such that Hi > x. The proof 
relies on the fact that 

E(5 n (A;)) = lim 9E(L k (N(x)) | N(x) = n) 1 < k < n - 1. 

x— »oo 

On the event {N(x) = n}, we will need to extend the definition of L k (N(x)) to k = n, as being 
the Lebesgue measure of the part of the tree up to time —x subtending all tips {0,1, ... ,n — 1}, 
that is, L n (N(x)) = x — maxj = i r .. jn _x H{. 

For editing reasons, we will prefer to write F(x) = ¥(H > x), instead of 1/W(x). Since F 
is a.e. differentiable and our goal is to let x — > do, we can set f(x) := —F'[x) without loss of 
generality. We let H denote the branch length Hj^m, and we set 

N : = min{fc > 1 : H^f/ x )+k > x + dx}, 

as well as the Lebesgue measure of the part of the tree subtending k leaves among individuals 
{N(x),N{x)+l, . . . , N(x+dx)-l}. Note that (N,L k ,H) are independent of (N(x), L h (N(x))); 
that H is distributed as H conditional on {H > x}; and that (N,L k ) is independent of 
H and distributed as (N(x + dx), L k (N(x + dx)). Next observe that if H > x + dx, then 
N(x + dx) = N(x) and Lk(N(x + dx)) = Lk(N(x)), except if k = n, where by definition 
L n (N(x + dx)) = L n (N(x)) + dx. On the other hand, if H G dx, Lk(N(x + dx)) is the sum of 
measures of edges subtending k tips in {0, 1, ... , N(x) — 1} with measures of edges subtending 
k tips in {N(x), . . . , N(x) + N — 1}. This reads 

L k (N(x + dx))l {N ( x+dx)=n} = ^{Hyx+dx^iN^n} (L k (N(x)) + dxl k=n ) 

+ 1 {H<x+dx} Yl Vm^'} 1 ^"-;} ( L k( N ( x )) + L k (N)) , 

where we have used the extension of the definition of L k specified earlier (cases when k = j or 
k = n — j in the sum) . Now set 

U Kn {x) :=E(L k (N(x)),N(x) =n). 
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By the independences stated previously, taking expectations, we get 

„ n— 1 r. 

U' k , n (x+) = -U k , n (x)l(x) + l fe=n P(iV(x) = k) + 2^ U kd (x)¥(N(x) = n- j)L(x). 

3=1 

Setting 

V k (x;s):=J2U k ,n( x ) sn *€[0,1), 

n>k 

and observing that \U k , n (x)\ < nx, and (so) that \U' kn (x)\ < c(x)n 2 for some positive c(x 
independent of k and n, we get 

^(x; S ) = -|(x)y fe (x; S ) + F(N(x) = k)s k + 2±{x) £ S " £ U w (i)P(JV(i) = n - j). 

n>fe i=l 

Since U k j(x) = when j < k — 1, the last term equals 

„ n— 1 „ 

4 (X) ^ s n £^(*)W*) = n-j) = 2|(x)^[/ fcj (xy £ S ^P(7V(x) = n- 

n>k+l j=k j>k n>j+l 

= 2^(x)Vk(x;s)^s n nN(x)=n). 

n>l 

As a consequence, we get the following differential equation 

-^(x; a) = G fc (x; S )y fc (x; a) + F(N(x) = k)s\ 

where we have put 

G k {x-s) := (2E ^)-l)l(x). 
Now since P(iV(x) = k) = F(x)(l - F(x)) fc_1 , we easily get 

[ X G k (y-s)dy = lJ Hj ' ] 
Jo 

This allows us to integrate the differential equation in V k (.;s) to finally arrive at 

.) = f (i - Ffe)) " 1(i - 8 + sF( " ))2 *■ 

With the shortcuts u := 1—F(x) and t> := 1 — F(y), and using the series expansion of (1— us)' 
we get 

V k {x;s) = s k {l-u) [ X v k ' l {l - ua) 2 y\V'" V" 1 dy. 

■/o 

It is elementary algebra to compute the following equality 

(i - vsf ^2 jV~ V'- 1 = i + ^2 sjuj ~ 2 U( u - v ? + u2 - v 2 ) > 
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L(l - s + sF(x) f 



which yields 

V k (x; s) = s k (l - u) f X v k ~ l dy + (1 - u) V l* v^s^u^ 2 (j(u - vf + u 2 - v 2 ) dy. 
Jo J > 1 Jo 

Identifying this entire series with the definition of V k , we get for all 1 < A; < n — 1, 

U k , n (x) = F(x)(l - F(x)) n - k ~ 2 f (1 - F(y)) k ~ 1 x 

Jo 

x ((n - k)(F(y) - F(x)) 2 + (1 - F(x)) 2 - (1 - F(y)) 2 ) dy. 

As a consequence, 

E(L fc (W(aO) | iV(x) = n) = (1 - F(x))-^ 1 f (1 - F(y)) k ~ l x 

Jo 

x ((n - k)(F(y) - F(x)) 2 + (1 - F(x)) 2 - (1 - F(y)) 2 ) dy. 
which, by Beppo Levi's theorem, converges, as x — ► oo, to 

poo 

e-^Snik)) = / (1 - F(y)) k ~ 1 ((„ - £;)F(y) 2 + 1 - (1 - F(y)) 2 ) dy, 
■/ o 

and this finishes the proof. □ 

2.3 Site frequency spectrum of large samples 

Here, we assume that E(min(i?i, H2)) < 00, that is, l/W 2 is integrable. 

Theorem 2.3 For all 1 < k < n — 1, i/ie following convergence holds a.s. (and in L 1 as well 
ifE(H) < 00) 

limn^Snik) = eE((min{H u H k+1 }-max{H 2 ,...,H k }) + ) 

r dx ( ^_\ k ' L 

Jo wW\~W(x)) ' 

Proof. Reasoning similarly as in the previous subsection, we see that a point mutation 
occurring on branch i is carried by k individuals if and only if it is carried by individuals 
i, i + 1, . . . , i + k — 1, and by no one else. This happens if and only if this mutation, correspond- 
ing to the atom £ij , say, of Vi , has 

max{idj + i, . . . , < £ij < Hi, 

for the mutation to be carried by individuals i, i + 1, . . . , i + k — 1, along with 

£ij < H^ k , 

for the mutation not to be carried by others. More formally, we set T the space of point 
processes on (0,oo), and F k the set of (A; + l)-dimensional arrays with values in T x (0,oo). 
Next, for any S G F k , written as S = ((po,xo), ■ ■ ■ 1 (Pk,%k)) we define 

G(S) := Card(p n (max{xi, . . . , x k -i}, min{x , x k })) , 
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where it is understood that the interval (a, b) is empty if a > b. Then the number of mutations 
carried by k individuals among the first n can be written as 

n—k 

S n (k) = Y,G(E l ), 

where 

Si := ((Vi,Hi), . . . , (Vi + k,H i+ k)) 

and, for the last term of the sum to be correctly written, H n is set to +00 (as Hq). Next, 
observe that 

E(G(Si)) = 6E ((mm{H lt H k+1 } - max{# 2 , . . . , H k }) + ) , 

so that Cr(Ex) is integrable (assumption stated before the theorem). Now for any < r < k, 
the random values G(Hj), for % such that i = r [k + 1] (standing for mod {k + 1)), are i.i.d. and 
integrable, so by the strong law of large numbers, we have the following a.s. convergence 

lim n- 1 G &) = rXT E ( G ( H i))- 

0<i=r[k+l]<n-k 

Actually, the convergence would also hold in L 1 if we had discarded mutations carried by 
individual and individual n — k, which involve terms that are not integrable if K(H) = 00. If 
K(H) < 00, then convergence holds in L 1 . Summing over r these k + 1 equalities, we get the 
convergence of n~ 1 S n (k) to E(G(Hi)), and 

E(G(Sl)) = 0E((min{F 1 ,^ +1 }-max{ J ff 2 ,..., J ff fc })+) 

/>oo 

= 6>E / dx lx<mm{H 1 ,H k+1 } ^-x>max{H 2 ,...,H k } 
JO 
/•oo 

= dx¥(H > x) 2 F(H < x)* -1 , 
Jo 

which ends the proof. □ 
2.4 Stable laws 

Here, we tackle the case when H is in the domain of attraction of a stable law, which happens 
in particular for a splitting tree whose contour process is a stable Levy process with no negative 
jumps with index a S (1,2]. If such a population is censused with intensity c > then the 
corresponding function W (see Introduction) is 

W(x) = 1 + cx"" 1 x > 0. 

From now on, we will assume that W has the form given in the foregoing display. Recall that 
1/W(x) is the probability that a branch has length greater than x. Observe that here H is not 
integrable, so that Theorems 12.11 and !2.2l do not apply. However, asymptotic results for the site 
frequency spectrum of large samples given in Theorem 12.31 apply for a > 3/2. 
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2.4.1 Brownian case 

Here, we assume that a = 2, which corresponds both to a (censused) Brownian population and 
to the (censused or not) population of a critical birth-death process. 

Theorem 2.4 When W(x) = 1 + cx, we have the following convergence in probability 

lim r? a = °/ c - 

n— >oo n m(n) 
Proof. Recall that is to be written as 



n-1 



Sn — ^ ^ Qi ~l" ^rti 



i=l 



where Qi is the number of points of the Poisson point process Vi in (0,i7j), and i? n is the 
number of points of the Poisson point process in (0, Y n ), where Y n = maxjiii, . . . , i? n _i}. 
Now observe that 



¥(Y n > enln(n)) = 1 - ( 1 - — ) 

\ l + cenm(n)/ 



n-1 



which vanishes as n — ► oo, so that Y n /n\n(n) converges to in probability. This implies in 
turn that R n /n\xi{n) also converges to in probability. As a consequence, we can focus on the 
sum of QiS. Pick any A > and check that 



E (^-^4yE*j=( E (« P -«(i-e-v»Mn»))) 

We are bound to study the behaviour of E(exp —yH) as y — ► 
E(exp -yiJ) = 1 - y / ^-r^ cte 



= 1 - y / — ; <iw 

lo y + cu 



= 1-y/ — ; du + y ■ du - yc' 1 ln((y + c)/y) 

A y + cu J y + cu 

= l + c- 1 yln(y) + 0(y), 

where 0(y)/y is bounded near 0. Setting u n := (l - e^M")), there is a vanishing sequence 
v n such that 

E^exp-A— = (l + C - 1 n n lnM+OK)) n (l + ! ; n ) 
\ nln(n) / 

= exp (c~ 1 nu n ln(u n ) + 0(nu n )) (1 + u n ), 
which converges to exp(— \9/c). □ 
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2.4.2 Stable case a^2 

Here, we assume that W(x) = 1 + ex" -1 , for some a € (1, 2). 

Theorem 2.5 When W(x) = 1 + cx a ~ l , we have the following convergence in distribution 



_S„ 

n — »oo 77, 1 



lim 1/(^-1) 



where (Zt;t > 0) is the stable subordinator with Laplace exponent A i— > c 1 £ * 1 A a e is an 
independent exponential r.v. with parameter 1, and if is defined by 



<p{x) = x l ~ a e~ x + / dss l ~ a e~ s x > 0. 
■/ o 

Remark 8 Observe that if decreases on (0, oo) from +oo to a positive limit, equal to r(2 — a). 
Also, recall that S n = Y17=i Qi + ^n> where R n is the extra contribution from the maximum 
branch length. Then it is possible to see by the same kind of proof as that of the theorem, that 
Y17=i Qi converges in distribution to ^r(2-a) • This indicates that, opposite to the Brownian 
case, the (double) contribution of the maximum branch length is not negligible here. 

Proof. Let us compute the limiting distribution of n -1 ^" -1 ) (Y n + ^2™=i Hi), where Y n = 
maxji^i, . . . , i? n _i}. Set (3 := 1/(q — 1), as well as 

/n(A) := E ^exp -An"" (v n + £ H^j j . 

Then 

/„(A) = J™ HYn € dz)e-^ z (E (e-^-^))"" 2 , 
where H' z has the law of H conditioned on being smaller than z. Next, we have 

p(y sdz)-( cza " X' 2 c{n - l){a - l)za " dz z>0 

and 

c{a - l)x a ~ 2 1 + cz a - 1 



F(H' e dx) = -j- 1NO = — dx < x < z, 



so we get 



r 

In(X) = / 
J 



(l + cz"- 1 ) 2 \Jo (l + cx"- 1 ) 2 
Changing variables, this also reads 

/ n (A) = c-^l - - ljA"- 1 / dv ? 

V 7 V Jo (l + n- 1 c- 1 A«- 1 v 1 -«) 2 



J n (v;X) 



n-2 
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where 



J n (v;X) = {a — l)cn)}~ a l du 

Jo 



1 + craA 1 -^"- 1 



o (1 + cnA 1 -"^- 1 ) 2 
v r v p -u 
du ■ 



o Jo 



1 + cnX 1 " 01 ^- 1 



e~ v f v e~ u 

1 1 + cnX 1 - a v a ~ 1 Jq dU 1 + cnX 1 ' a u a ~ 1 
l-n^K^v-X), 



where K n (v;X) is positive and converges to c 1 X a l <p(v) as n — > oo. By the Lebesgue conver- 
gence theorem, we get the convergence of I n (X) to 



/•oo 

c-V-l)A Q_1 / dvv~ a e~ 2v e^-c^X*' 1 ^)). 
Jo 

Integrating by parts with ip'(v) = (1 — a)w _a e - ' 1 ', we finally get 

/■oo 

lim / n (A) = / dt;e- 1 'exp(-c- 1 A^V(«))- 

n->oo 7 Q 

The last step is the same as in the foregoing proof, that is 

lim Efexp-An- 1 /^ 1 ^ = lim E ( exp-6> ( 1 - e -^- 1/(a - 1> ) I y n + V" Hl 



% 

i=l 



lim J n (0A) 

n— +oo 
/■oo 

/ dve- ?, exp(-c~ 1 r~ 1 A Q -V(«)), 

JO 



which is the desired result. □ 

3 Number of distinct haplotypes 

3.1 The next branch with no extra mutation 

We let £ e denote the set of individuals who carry no more mutations than individual (some 
of and at most exactly the mutations carried by 0, but no other mutation). Set K® := and 
for % > 1, define Kf as the i-th individual in £ e , and Hf := H K e the associated branch length. 

We write H 9 in lieu of H® and we define the function Wg by 

P(H e >x) = — l — x > 0. 

Wg(x) 

Proposition 3.1 The bivariate sequence ((Kf — Kf_ ± , Hf);i > 1) is a sequence of i.i.d. ran- 
dom pairs. The function Wg is given by 



W e (x) = 1 + f W'(u)e~ 0u du x > 0. 
Jo 
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Remark 9 In the case when the coalescent process is derived from a splitting tree with lifespan 
measure A, the calculation of W$ is straightforward. Indeed, it can be seen in that case that 
the point process (Hf;i > 1) is the coalescent point process of the splitting tree obtained from 
the initial splitting tree with mutations after throwing away all points above a mutation. But 
this new tree is again a splitting tree, since lifespans are i.i.d. and terminate either at death 
time or at the first point mutation, so the lifespan measure is now Ag(dx) = e~ 0x A(dx) + 
9e~ 9x A((x, oo)) dx. As a consequence, We is here the scale function characterised as in ([3]) by 
its Laplace transform 



dxe~ Xx W e (x) = (A-/ A e {dx)(l - e~ Xx ) 

-l 



o 



H 

fx + O — A(dx)(l - e ~ {x+e)x ) 



A + 
A 
A + 



o 



A 

which yields the equality given in the statement. 



oo 

dxe- {x+e)x W{x) 



Proof. First observe that the pair (Kf , Hf) does not depend on the haplotype of individual 
0, and that the i-th individual with no mutation other than those carried by individual is 
also the next individual after Kf_ 1 with no mutation other than those carried by individual 
Kf_ 1 . This ensures that (Kf — Kf_^, Hf) has the same law as (Kf , Hf), and the independence 
between (Kf — Kf_ x , Hf) and previous pairs is due to the independence of branch lengths and 
the fact that new mutations can only occur on branches with labels strictly greater than Kf_ 1 . 

Now the event {H 9 G dx} can be decomposed according to: the value of H\\ conditional 
on H\ = z, the value of the age V z of the oldest mutation on H\ ; conditional on V z = y, the 
value H' y of the branch length associated with the first individual in £f with branch length 
greater than y. Indeed, H d G dx if: Hi G dx and there is no mutation in Hi (then K® = 1); or 
Hi G dx, the age of the oldest mutation on Hi = x isV x = y < x and the next individual with 
no mutation other than those carried by individual 1 and branch length H' y > y has H' y < x; 
or Hi = z < x, the age of the oldest mutation on Hi = z isV z = y < z and the next individual 
with no mutation other than those carried by individual 1 and branch length H' y > y has 
H' y G dx. 

/•X 

F(H G dx) = F(Hi G dx)e~ 0x + F(H ± G dx) / F(V X G dy)F(H' < x) 

Jo 

[ F(Hi G dz) [ F(V Z G dy)F(H' y G dx). 
Jo Jo 



+ 



Thanks to the first statement of the proposition, H' y has the same law as H e conditioned on 
being greater than y. Then since F(V Z G dy) = 6 e~ e<yZ ~ v ^ dy, we get 

F(H e G dx) = F(Hi G dx)(l - F(H 9 > x)f(x)) + F(H e G dx) [ F(Hi G dz)f(z), 

Jo 

where we have set 

f(x) := [ dy8e- 0{x - y) W e (y) x>0. 



o 
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We can drop the index 1 of H±, since only its law now matters. We can rewrite the last result 
as 

P(H e dx) = P(H 6 € dx)(l - / P(H € dz)f(z)) + P(H £ dx)P(H > x)f(x), 

Jo 

which can be integrated as 

t*x 

P(H >x)= P(H e > x)(l - / P(H £ dz)f(z)). 

Jo 

Defining now the function G as 

G(x) :=P(H>x)(W e (x)-f(x)), 
we get, thanks to the last integration, 

G(x) = 1 - f P(H G dz)f(z) - P(H > x)f(x). 
Jo 

Integrating by parts yields 

G(x) = l- dzP(H > z)f'(z) = 1- / dzP(H > z)(-6f(z) + 6W e (z)) = 1-9 dzG(z), 
Jo Jo Jo 

which shows that G(x) = e~ 6x . This reads 

W(x) = e 9x W e (x) -el dy e 9y W e (y). 
Jo 

One differentiation and one integration provide the result. □ 

3.2 Main result 
3.2.1 Statement 

Recall that A n (k) denotes the number of haplotypes carried by k individuals in a sample of n. 
Theorem 3.2 For all k > 1, the following convergence holds a.s. 

roc 1 / 1 \ fc-1 

lim n~ l A n (k) = / d x9e~ 9x -—^ 1 - -— 



In addition, 



lim n- l A n = r dx6 e~ ex —^— = E ( 1 - . 

n^oo J Q Wg(x) V / 



Before proving this statement, we insert a (sub)subsection in which we state and prove a 
preliminary key result. 
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3.2.2 The key lemma 

Recall that £n denotes the (time elapsed since the) i-th (most recent) mutation on the first 
branch length. In particular, the mutations carried by individual 1 and not by individual are 
exactly those £n such that £u < Hi (the other points of the process are thrown away). Let iVj 
denote the number of individuals whose most recent mutation is in. 

Lemma 3.3 In an infinite sample, for any integer k>\, 

Em-")- 1 ^^(i-siiJi) 

Proof. In the first place, not to care for the fact that only mutations with £n < H\ contribute, 
we consider the number N- of individuals whose most recent mutation is £qi , and we condition 
on £oj = Vj, j > 1. We will use later the fact that the law of iVj conditional on l\~ = Vj, j > 1, 
is that of N[1 Vi< h, where H is independent of N- and the point process (£oi; i > 1). 

Recall from the previous subsection that 6 9 is the set of individuals who carry no more 
mutations than individual 0, that Kf is the i-th. individual in £ e , and Hf := H K e. Then set 
Do := and 

A := inf{j > 1 : H? > v^} i > 1. 

Now observe that N- = Di — for all i > 1 (for N[, the count includes individual 0). As 
an application of Proposition 13.11 we get that conditional on £qj = Vj, j > 1 , 

p(iV{ = k) = ¥(H e < vif-^H 6 > Vl ), 

whereas for any i > 2, 

F(N-^ 0) = F(H < Vi | H e > vi-i) and P(iV- = k \ N- ^ 0) = P(H e < Vi) k - l F{H e > v t ). 

Recalling the relation between the laws of iVj and mentioned in the beginning of the proof, 
we get that conditional on £\j = Vj, j > 1, 

P(JVi = k) = F(H < vi) fc_1 P(i? e > tn)F(H > v%). 

whereas for any i > 2, 

F(Ni / 0) = F(H e <v l \H e > «i_i)P(fr > vi). 
Now F(Nl = k\N(^0)= F(Ni = k \ Ni + 0), so we finally get (for i > 2) 

F(Ni = k) = F(H e < v l ) k - 1 F(H 9 < Vi \H e > v l - 1 )F{H 9 > Vl )F(H > Vl ) 

i V- 1 L _ w e (vj-i) \ i 
We(vi)J V Wg( Vi ) J W{vi)We{vi)' 

It is well-known that for the Poisson point process of mutations, 

6 i x i ~ 2 

P(4,i-i G dx, £\i £ dz) = — — - e~ dz dxdz < x < z, i > 2, 
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so that 

Y J ^(N i = k) = I dz I dx 



i>2 



i>2 ' 



o (i-2)! 



dzO e 



-Oz 



o 



1 



1 



fe-i 



W e (z) 



1 



fc-i 



W e (z)J W(z)W e (z)J 



dx6e dx I 1 



W e (x) \ 1 
W e {x) 



W e (z) 



Now thanks to Proposition 13.14 we can perform the following integration by parts on the last 
integral in the last display 



dx 6 e 9x I 1 



W e {x) 
W e {z) 



e dx I 1 



W e (x) 
We(z) 



+ 



1 



dxe 6x W e (x) 



1 1 
+ — — + 



W (z) W e (z) J 



o W e (z) J 
dxW'{x) 



W(z) 
W e (z) 



1. 



This entails 



f°° / 1 



fc-i 



1 



W(g) 



i . 



But since 



P(iVi = k) 



dzde~ ez ( 1 



1 \ k 1 1 



the result follows. 



□ 



3.2.3 Proof of Theorem EOl 

For each individual i > 0, we denote by .Ay the set of individuals bearing the unique haplotype 
whose most recent mutation is £{j. In particular, it is understood that Ay = whenever 
iij > Hi (because no such haplotype exists). 

Now fix M > 1 . Similarly as in the proof of Theorem 12.31 we can define 

G M (Ei) := Card {j > 1 : Card Ay n {£,..., i + M} > k}, 

where 

Observe that Gm is bounded from above, so that GmC^i) is integrable for all i > 0. Now for 
any < r < M, the random variables Gkf(Ej), for i such that i = r [M + 1] (standing for mod 
(M + 1)), are i.i.d. and integrable, so by the strong law of large numbers, we have the following 
convergence a.s. (and in L 1 ) 

lim n- 1 Yl G ™&) = TTZT HGm^i))- 

n— >oo * — • 1V1 + 1 

0<i=r[M+l]<n-M 



21 



Summing over r these M + 1 equalities, we get the following convergence a.s. (and in L 1 ) 

n-M 

lim n" 1 V G M (Ei) = E(G M (H 1 )). 

n— >oo » 
i=0 

Our goal is now to let M — > oo. First define 

n 

A' n (k) := ^ Card {j > 1 : Card Aj D {i, . . . , n} > fe}. 

i=0 

Notice that 

A! n {k) = Y J K{h). 

h>k 

Then for any i = 0, . . . , n — M, for any j > 1, if Card A%j D {«,...,« + M} > k, then 
Card n {i, . . . , n} > k, so that A' n (k) > Y™~™ G M (^i), and 

n-M 

limmfn" 1 ^^) > liminf n" 1 V G M (Hi) = E(G M (Si)). 

n— >oo n— >oo ^— ' 

i=l 

Letting M — > oo, Beppo Levi's theorem yields 

liminf n^A^ik) > E [Card {j > 1 : Card Aj > fc}] = J^P(Card Aj > k) =: y k . 

n ^°° j>i 

In the notation of the previous subsection Card A\j = Nj, so by Fubini-Tonelli's theorem, 
y k = Y, ¥ ( N J > k ) =EE P ^ = h ) = E^' 

j>l j>l h>k h>k 

where x k ■= Yl^i^i-Nj = k). Thanks to Lemma 13.31 we have the following explicit expression 
for x k 

poo 1 / I \ fc-1 

Xk = / e e- 6z dz -—-r I 1 



l ' W e {zf V W e (z) 

Now recall that Y^k>i ^l(^) = E/i>i^nW = n - Since it is easily seen that Y2k>iVk = 
z\^>i /ix/j = 1, by Fatou's lemma 

1 = y^ yfc < lim inf n~ 1 ( < liminf n _1 V A' (Jfe) = 1. 

Z — ✓ Z — ✓ n n Z — / 

fe>l fc>l fe>l 

Then we would get a contradiction if there was &o such that liminf n n _1 j4^(/co) > Uk , so that 
for all k > 1 a.s., 

lim n _1 ^(^) = 

The first equation of the theorem stems from the fact that A n (k) = A' n (k) — A' n (k + 1) and the 
second one by taking k = 1 in the last display. It takes an elementary integration by parts to 
check that 

roc -1 

Vi= dxee- e *-?—=E(l-e~ eH0 
Jo Wg[x) 



22 



Acknowledgments. This work was partially funded by the project MAEV 'Modeles Aleatoires 
de l'Bvolution du Vivant' of ANR (French national research agency). 

References 

[1] Abraham, R., Delmas, J.F. (2008) 

Williams' decomposition of the Levy continuous random tree and simultaneous 
extinction probability for populations with neutral mutations. Stock. Proc. Appl. 
doi:10.1016/j.spa.2008.06.001 

[2] Aldous, D., Popovic, L. (2005) 

A critical branching process model for biodiversity. Adv. Appl. Probab. 37 1094-1115. 

[3] Arratia, R., Barbour, A.D., Tavare, S. (1992) 

Poisson process approximations for the Ewens sampling formula. Ann. Appl. Probab. 2 
519-535. 

[4] Basdevant, A.L., Goldschmidt, C. (2008) 

Asymptotics of the allele frequency spectrum associated with the Bolthausen-Sznitman 
coalescent. Preprint arXiv:0706.2808vl 

[5] Berestycki, J., Berestycki, N., Schweinsberg, J. (2007) 

Beta-coalescents and continuous stable random trees. Ann. Probab. 35 1835-1887. 

[6] Bertoin, J. (1996) 

Levy processes. Cambridge University Press, Cambridge. 

[7] Bertoin, J. (2008) 

The structure of the allelic partition of the total population for Galton- Watson processes 
with neutral mutations. Preprint arXiv:0711.3852 

[8] Donnelly, P., Tavare, S. (1986) 

The ages of alleles and a coalescent. Adv. Appl. Probab. 18 1-19. 

[9] Durrett, R. (2008) 

Probability Models for DNA Sequence Evolution. Springer- Verlag, Berlin. 2nd revised ed. 

[10] Ewens, W.J. (2005) 

Mathematical Population Genetics. 2nd edition, Springer- Verlag, Berlin. 

[11] Fisher, R.A., Corbet, S.A., Williams, C.B. (1943) 

The relation between the number of species and the number of individuals in a random 
sample of an animal population. J. Anim. Ecol. 12 42-58. 

[12] Geiger, J., Kersting, G. (1997) 

Depth-first search of random trees, and Poisson point processes, in Classical and modern 
branching processes (Minneapolis, 1994) IMA Math. Appl. Vol. 84. Springer- Verlag, New 
York. 



23 



[13] Kingman, J.F.C. (1982) 

The coalescent. Stochastic Process. Appl. 13 235-248. 

[14] Lambert, A. (2008) 

The contour of splitting trees is a Levy process. Preprint arXiv:0704.3098vl 

[15] Lambert, A. (2008) Population Dynamics and Random Genealogies. Stock. Models 24 
45-163. 

[16] Mohle, M. (2006) 

On the number of segregating sites for populations with large family sizes. Adv. Appl. 
Prob. 38 750-767. 

[17] Popovic, L. (2004) 

Asymptotic genealogy of a critical branching process. Ann. Appl. Prob. 14 2120-2148. 



24 



